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Bipolar disorder is characterized by impaired decision-making captured in impulsivity and risk-taking. We sought to determine 
whether this is driven by a failure to effectively weight the lower-order goal of obtaining a strongly desired reward in relation to 
higher-order goals, and how this relates to trait impulsivity and risk-taking. We hypothesized that in bipolar disorder the 
weighting of valuation signals converging on ventromedial prefrontal cortex are more heavily weighted towards ventral striatum 
inputs (lower-order), with less weighting of dorsolateral prefrontal cortex inputs (higher-order). Twenty euthymic patients with 
bipolar disorder not in receipt of antipsychotic medication and 20 case-matched controls performed a roulette task during 
functional magnetic resonance imaging. Activity in response to high-probability ('safe') and low-probability ('risky') prospects 
was measured during both anticipation, and outcome. In control subjects, anticipatory and outcome-locked activity in dorso- 
lateral prefrontal cortex was greater for safe than risky reward prospects. The bipolar disorder group showed the opposite 
pattern with preferential response to risky rewards. This group also showed increased anticipatory and outcome-locked activity 
in ventral striatum in response to rewards. In control subjects, however, ventromedial prefrontal activation was positively 
associated with both ventral striatum and dorsolateral prefrontal activity; patients evidenced a strong positive association 
with ventral striatum, but a negative association with dorsolateral prefrontal cortex. Response to high-probability rewards in 
dorsolateral prefrontal cortex was inversely associated with trait impulsivity and risk-taking in the bipolar disorder group. Our 
findings suggest that clinically impulsive and risky decision-making are related to subjective valuation that is biased towards 
lower-order preference, with diminished integration of higher-order goals. The findings extend a functional neuroanatomical 
account of disorders characterized by clinically impulsive decision-making, and provide targets for evaluating interventions that 
foster self-control. 
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Introduction 

Impulsive and risky decision-making is characteristic of several 
psychiatric disorders including substance dependence (Rogers 
et al., 2010), bipolar disorder (Swann et al., 2009), attention 
deficit hyperactivity disorder (Scheres et al., 2010), pathological 
gambling (Rogers et al., 2010) and psychopathy (Vitacco and 
Rogers, 2001). Impulsivity can be conceptualized as a diminished 
self-control to suppress behaviours that afford preferred (highly 
desirable) outcomes, but which contravene higher-order or 
longer-term goals and so are ultimately suboptimal in the long 
run (Hare et al., 2009; Diekhof et al., 2012). Common fronto- 
striatal regions are implicated in optimal versus impulsive decision- 
making, whether the decision involves selecting a tastier over a 
healthier food option (Hare et al., 2011), or a safe but low-return 
prospect over a risky but potentially more lucrative investment 
(Peters and Buchel, 2009). Here, we present recent developments 
on the functional basis of self-control in decision-making to 
examine suboptimal decision-making in bipolar disorder during a 
probabilistic task in which safe and risky reward prospects are 
evaluated. 

The functional basis of optimal 
decision-making 

Ventral frontostriatal regions have largely been implicated both in 
motivating behaviour towards obtaining desired outcomes — or 
rewards — and, subsequently, in the hedonic impact of rewards 
once obtained. The nucleus accumbens, part of the ventral stri- 
atum, has been shown to code both probability and delay features 
of rewards indiscriminately, pointing towards a common 'neural 
currency' (Peters and Buchel, 2009). This region responds 
preferentially to (i) low-probability rewards that are better- 
than-expected and so more subjectively gratifying (Yacubian 
et al., 2006; Smith et al., 2009); and (ii) immediate rewards 
over those that are superior in the longer-term (McClure et al., 
2004). In contrast, activity in dorsofrontal structures, most notably 
the dorsolateral prefrontal cortex (PFC), is associated with mediat- 
ing behaviour in the direction of safer over risky prospects 
(Campbell-Meiklejohn et al., 2008) and with delaying gratification 
in favour of superior but delayed rewards (McClure et al., 2004). 
Underlying the role of the dorsolateral PFC in decision-making is 
its central ability to detect and direct attention towards goal- or 
task-relevant stimuli (Banich et al., 2000; MacDonald et al., 
2000). 

Signals from ventral striatum and dorsolateral PFC converge in 
the ventral PFC, an area understood to encode an integrated valu- 
ation of prospects taking account of the trade-off between lower- 
order preference and longer-term profitability (Hare et al., 2009) 
and which cuts across both delay and probability domains (Peters 
and Buchel, 2009). Activation of the ventromedial PFC reflects 
subjective preference curves for different prospects (Kable and 
Glimcher, 2007) and predicts behavioural choice (Plassmann 
et al., 2006). 

Regulation of activity in the ventromedial PFC via the dorsolat- 
eral PFC has been demonstrated in tasks requiring self-control 



over behaviours that would secure strongly desired outcomes, 
but which are suboptimal in the long run. This includes, for ex- 
ample, selecting a healthier but non-preferred food option (Hare 
et al., 2009), or deciding not to chase increasingly risky rewards 
after experiencing losses (Campbell-Meiklejohn et al., 2008). The 
dorsolateral PFC is also recruited in non-choice contexts when 
subjects are instructed to cognitively reappraise a rewarding pro- 
spect or outcome, accomplishing this through the modulation of 
the ventral striatal signal in the ventral frontostriatal pathway 
(Staudinger et al., 2009, 2011). The functional outcome of this 
modulation is a differential weighting (in the ventromedial PFC) of 
the lower-order preference, mediated by ventral striatum, against 
higher-order goals represented in dorsolateral PFC activity. 

Decision-making in bipolar disorder 

Bipolar disorder is characterized by marked difficulty in regulating 
the pursuit of goals (Johnson, 2005), with the onset of manic and 
depressive episodes linked to the attainment and failure to obtain 
goals, respectively (Johnson et al., 2008). Although particularly 
elevated during mania, impulsivity represents a trait feature of 
the disorder (Strakowski et al., 2010) and there is evidence of 
altered frontostriatal processing of reward prospects and outcomes 
across mood episodes. A recent study of euthymic patients 
demonstrated hyperactivation of ventral striatum and ventro- 
medial PFC when anticipating rewarding outcomes (Nusslock 
et al., 2012), but no difference at outcome. Similarly in a non- 
clinical sample exhibiting subsyndromal hypomania, striatal activity 
during anticipation was more strongly modulated by the reward 
value of prospects, compared with control subjects (O'Sullivan 
et al., 2011). Other studies of this at-risk population have 
shown both a greater hedonic impact of reward outcomes gener- 
ally (Mason et al., 2012a) as well as a preference for immediate 
over delayed-but-superior rewards (Mason et al., 2012b), as 
indexed by an event-related potential originating from the ventral 
frontostriatal pathway (Carlson et al., 2011). 

The findings are not clear-cut for how these frontostriatal 
systems are modulated by affective state. One study of patients 
experiencing mania reported an elevated ventral striatal response 
to omission of reward outcomes, but no difference in response to 
expected reward outcomes (Abler et al., 2007a), although unex- 
pected rewards, which may be more valued, were not available in 
their design. The failure to deactivate ventral striatum for reward 
omission may point towards a reduced appreciation of the cost of 
these null outcomes on the higher-order goal of maximizing profit. 
In another study, patients experiencing mania showed hyperacti- 
vation of ventral prefrontal cortex given the prospect of increasing 
reward, but relatively reduced activation when faced with increas- 
ing loss (Bermpohl et al., 2010). Healthy controls showed the 
inverse pattern, indicative of an appreciation that larger losses 
contravene the higher-order goal of maximizing task earnings, 
whereas manic individuals were less concerned about this and 
instead were motivated by larger gains. 

Taken together, these studies indicate that goal dysregulation in 
bipolar disorder manifests neurally as a strong lower-order prefer- 
ence for rewarding prospects (Bermpohl et al., 2010; O'Sullivan 
et al., 2011; Nusslock et al., 2012) and reduced appreciation for 
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risky prospects that are suboptimal with regards to a higher-order 
goal (Bermpohl et al., 2010). The present study aimed to test 
whether impulsive and risky decision-making results from a valu- 
ation in ventromedial PFC that is biased towards strongly desired 
but risky prospects (mediated by ventral striatum) over those that 
are better in the long-run (dorsolateral PFC-mediated). To reduce 
medication effects, we recruited euthymic patients that were not 
receiving antipsychotics, which are likely to be the most 
problematic class for studying reward processing (Pessiglione 
et al., 2006; Abler et al., 2007/?). A second aim, in light of the 
equivocal findings in the above studies, was to examine how trait 
differences in frontostriatal regions are modulated by affective 
symptoms. 

To this end we assessed how activity in the dorsolateral PFC, 
ventromedial PFC and ventral striatum was influenced by safe and 
risky gambles, determined by probability of reward in a roulette 
task. Consistent with a role in coordinating the higher-order goal 
of maximizing overall winnings, we expected that the dorsolateral 
PFC would activate preferentially for safe gambles and that this 
would be negatively associated with real-life impulsivity and 
risk-taking traits. In contrast, we predicted that activity in ventral 
striatum would preferentially respond to risky gambles and that 
this would be positively correlated with trait impulsivity and 
risk-taking, in keeping with a stronger lower-order preference 
for unexpected (low probability) rewards. We assumed that opti- 
mal integration of these signals in ventromedial PFC in healthy 
controls would manifest as a stronger correlation with dorsolateral 
PFC activity than with ventral striatal activity (i.e. final valuation 
being more contingent on the higher-order goal of being success- 
ful in the long run with safe gambles). We predicted that patients 
with bipolar disorder would show the opposite pattern, with 
ventromedial PFC activity correlating more strongly with ventral 
striatum, consistent with final valuation being more contingent on 
lower-order preference. A separate set of analyses looked at the 
impact of state fluctuation in symptoms. 

Materials and methods 

Participants 

Twenty patients with bipolar disorder in remission were case-matched 
with 20 healthy control subjects by age, gender and level of education 
(Table 1). Key inclusion criteria were age 18-45 years, no current 
alcohol problem (weekly intake ^25 units) or substance use in the 
past 4 months. In addition we excluded participants that had received 
antipsychotic medication in the past 6 months to reduce the effect of 
medication on reward -related activations (Pessiglione et al., 2006; 
Abler et al., 2007b). 

Diagnosis was established using the Structured Clinical Interview for 
DSM-IV Axis I Disorders (SCID; First et al., 2002), with remission 
defined as not meeting the criteria for manic or depressive episodes 
in the past 2 months. Residual symptoms of depression and mania 
were assessed using the 17-item Hamilton Depression Rating Scale 
and 12-item Bech-Rafaelsen Mania Scale, respectively. Informed writ- 
ten consent was obtained from all participants, and the study was 
conducted in accordance with the Declaration of Helsinki. Additional 



details of the recruitment procedure are reported in the online 
Supplementary material. 

To generate behavioural measures of real-life impulsivity and 
risk-taking that could be used in further validating the association of 
dorsolateral PFC and ventral striatum activity with impulsivity and 
risk-taking, participants completed the 11 -item version of the Barratt 
Impulsivity Scale (Patton et al., 1995) and the Domain-Specific 
Risk-Taking Scale (Blais and Weber, 2006). 

Task 

Participants played a modified version of a previously validated 
Roulette task (van Eimeren et al., 2009) comprising three time 
stages (Fig. 1A): selection, anticipation, and outcome (see 
Supplementary material for timing information). In this task, 
participants select between four choices that are equivalent in terms 
of available knowledge of reward probability, with the only difference 
being colour. Two variables were manipulated: probability of reward 
and magnitude, and each condition was presented as a separate trial, 
rather than as opposing choices within a given trial. Given that 
patients with bipolar disorder behave differently in decision-making 
tasks (Swann et al., 2003; Adida et al., 2011), this design precluded 
the eventuality of a mismatch in the numbers of responses for each 
condition. In the low probability (25% reward; 'risky') conditions, the 
four options were any one of the four colours that made up the 
roulette wheel. In the high probability conditions (75% reward; 
'safe'), participants chose between four sets of three colours each, 
and won if the Roulette wheel stopped on any of the three colours 
in the set that they chose. The stake was also fixed, and varied equally 
between trials of low (£3) and high (£9) magnitude. The magnitude at 
stake was presented during the selection phase. During the 
anticipation phase, the wheel spun. It stopped spinning at the 
outcome phase to reveal whether the participant had won or lost. 
Losses were penalized by the magnitude that had been at stake on 
the particular trial. 

Participants were instructed to respond within the fixed selection 
time and informed that a random choice would be automatically 
made if a timely response was not issued. Participants completed a 
total of 272 trials over eight runs (~6min each), with probability and 
stake being equally distributed across each run. Participants were in- 
formed that they would be paid the actual winnings from the task at 
the end of the experiment. 

Functional magnetic resonance imaging 
data analysis 

Standard functional AARI data acquisition and preprocessing are 
described in detail in the Supplementary material. We modelled the 
influence of motivational factors at anticipation and outcome. At an- 
ticipation, reward probability (25%, 75%) and magnitude (£3, £9) 
were modelled. At outcome, the factors were valence (gain, loss) 
probability (low, high) and magnitude (£3, £9). The selection phase 
was modelled as a regressor of no interest. We did not model motiv- 
ational factors for this phase because of confounds from motor re- 
sponse and the short duration of this phase (1.5 s), which would 
reduce the power to reliably disambiguate neural activities related to 
this stimulus. Group (bipolar disorder or healthy control) was entered 
as a between-groups factor in each analysis of variance. Current 
depressive and manic symptoms were included as covariates in a sep- 
arate analysis of covariance. A separate analysis of variance examined 
the trial-wise modulation of brain activity in relation to expected value 
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and prediction error. Due to space constraints, both this analysis and 
those exploring interaction with mood symptoms are reported as 
Supplementary material. 

The Harvard -Oxford probabilistic atlas (http://www.fmrib.ox.ac.uk/ 
fsl/) was used to generate region of interest masks for bilateral nucleus 
accumbens and the medial frontal gyrus portion of dorsolateral PFC. 
The region of interest mask for the ventromedial PFC, unavailable in 
the Harvard -Oxford atlas, was functionally defined from prior publi- 
cation (Hare et al., 2009), using 10 mm spheres around peak coord- 
inates reported. The mean signal (beta values) across voxels in these 
regions of interest for individual trials were extracted for each contrast 
using the MarsBaR SPAA toolbox (http://marsbar.sourceforge.net/) 
and exported to Statistical Package for the Social Sciences (SPSS) for 
all further analyses. This enabled a range of additional analyses 
(including post hoc tests) to be performed within a single and 
widely used package, as well as enabling versatile graphs and plots 
to be generated. 

Table 1 Demographics and behavioural data 



Modulation of ventromedial prefrontal 
cortex valuation by dorsolateral 
prefrontal cortex and ventral striatum 

We evaluated the degree to which activity in ventral striatum (lower- 
order preference) and dorsolateral PFC (long-term goal) predicted ac- 
tivity in ventromedial PFC (weighted by both lower-order preference 
and the long-term goal), and how this differed between groups. To 
address this question we used a simplified regression model restricted 
to just these three key brain regions, which were identified a priori by 
previous theoretical and empirical work (Hare et al., 2009, 2011). In 
our analysis, outcome-locked activity in ventromedial PFC in response 
to gains only was the dependent variable, with (gain) outcome- 
locked activities in ventral striatum and dorsolateral PFC as 
covariates and group was a fixed factor. Although directionality 
cannot be inferred from our analysis, previous work using causal 





Remitted bipolar disorder 


Healthy controls 
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0.34 


Female 


i u/zu 




11/20 




x 2 (D = o.i 
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4/20 












HRSD-17 


3.82 


3.04 


0.63 


1.06 


£(38) = -4.45 


^0.001 


MAS-12 


3.23 


2.78 


0.40 


1.13 


£(38) = -4.21 


^0.001 


VAS-Anxiety 


3.00 


6.55 


1.42 


4.23 


£(37) = -0.889 


0.38 


VAS-Sadness 


-2.61 


13.60 


-1.58 


5.01 


£(37) = 0.232 


0.82 


Response time (ms) 


724.5 


188.0 


741.6 


191.8 


£(38) = 0.285 


0.78 


Response (%) 














Choice 1 


25.1 


4.51 


25.0 


5.48 


£(38) = -0.07 


0.95 


Choice 2 


27.9 


6.40 


25.6 


5.04 


£(38) = 1.23 


0.23 


Choice 3 


24.3 


4.13 


24.9 


4.45 


£(38) = -0.49 


0.63 


Choice 4 


17.8 


5.02 


19.2 


4.34 


£(38) = -0.89 


0.38 



Patients with bipolar disorder did not differ from the controls in their response time or proportions of each option chosen. 

*SUD = substance use disorder; GAD = generalized anxiety disorder; OCD = obsessive-compulsive disorder; HRSD = Hamilton Depression Score; MAS = Bech-Rafaelsen 
Mania Score; VAS = visual analogue scale score (-50 to +50); BD = bipolar disorder; SSRI = selective serotonin reuptake inhibitor; SNRI = serotonin-norepinephrine 
reuptake inhibitor. 
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modelling (Hare et al., 2011) highlights ventromedial PFC as being 
confluence of valuation signals that receives ventral striatum and 
dorsolateral PFC inputs. A second analysis step assessed state-related 
modulation of the main effects by adding residual affective symptoms 
(scores on the Hamilton Depression Rating and Bech-Rafaelsen Mania 
Scales) to the above model as covariates. Correlations explored signifi- 
cant effects identified by this ANCOVA. Region of interest masks were 
an average across hemispheres, unless there was a main effect of 
hemisphere. In this latter eventuality, mean activity in the maximal 
hemisphere was taken, to increase statistical power. 

Effects of trait impulsivity and 
risk-taking on the processing of safe 
and risky gains 

Repeated measures ANOVA were conducted on neural activity in re- 
sponse to gain outcomes in ventral striatum, dorsolateral PFC and 
ventromedial PFC separately. Probability and group were fixed factors, 
and total impulsivity (Barratt Impulsivity Scale) and risk-taking 
(Domain-Specific Risk-Taking Scale) scores were entered as covariates. 



Significant interactions with covariates were followed up with partial 
correlation, controlling for group. 



Results 

Clinical and behavioural data 

Demographics, clinical variables, medications, and behavioural re- 
sults are reported in Table 1 . Patients with bipolar disorder showed 
higher levels of state-related symptomology in depression 
(Hamilton Depression Rating Scale) and mania (Bech-Rafaelsen 
Mania Scale) scores relative to healthy control subjects (although 
all in the low range of ^8). Patients also scored higher on trait 
impulsivity [total score: £(1 ,38) = 5.86, P < 0.001 ; motoric 
subscale: P = 0.06; non-planning: P = 0.089] and risk-taking be- 
haviours [total score: £(1 ,38) = 2.13, P = 0.04], reflecting the 
higher levels of behavioural impulsivity and risk-taking in bipolar 
disorder. 



11.25s 
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Figure 1 Trial schematic and associated neural activity. (A) Participants made bets on which colour would win in a Roulette gamble. The 
trial sequence comprised three phases: selection; anticipation while the wheel spun; and outcome evaluation when the ball stopped on one 
of the colours, signifying the delivery of the reward or loss. (B) Whole-brain analysis of probability and valence during anticipation and 
outcome, respectively. Dorsolateral prefrontal cortex activated more during anticipation of high compared to low reward probability 
(cluster threshold P = 0.011 uncorrected, see Supplementary Table 1). At outcome, the nucleus accumbens was more active for rewards 
than losses (left and right cluster threshold P < 0.05 familywise error corrected). 



A reward-functional MRI study of patients not on antipsychotic medication 



Brain 2014: 137; 2346-2355 I 2351 



Functional magnetic resonance 
imaging findings 

This section reports the a priori region of interest analyses (see 
Supplementary Tables 1-3 for activated foci from a whole-brain 
analysis). The analyses are reported in the following order: (i) 
group comparisons; (ii) modulation of valuation by ventral and 
dorsal systems; and (iii) trait effects of impulsivity and risk- 
taking. The main analysis of state effects of residual affective 
symptoms on activity is reported as Supplementary material. 

Group comparisons 

Anticpation: The dorsolateral PFC region of interest showed main 
effects of reward probability [F(1 ,38) = 12.64, P< 0.001] and 
magnitude [F(1 ,38) = 6.91 , P = 0.012] as well as a probability by 
group interaction [F(1 ,38) = 4.1 , P = 0.05]. Across all participants, 
the dorsolateral PFC was more active for prospects that afforded a 
high probability of reward, and for prospects of larger magnitude. 
This is consistent with activity in dorsolateral PFC aligning with the 
higher-order goal of maximizing rewards. In the bipolar disorder 
group, however, the effect of probability was reduced, such that 
these patients showed a smaller increase in dorsolateral PFC acti- 
vation for high-probability (relative to low-probability) prospects 



[£(19) = -4.61, P < 0.001], compared to controls [£(19) = -0.96, 
P = 0.35]. 

In ventral striatum, a three-way group by probability by hemi- 
sphere interaction approached significance [F(1 ,38) = 3.81 , 
P = 0.058], which indicated that in left ventral striatum, the in- 
crease in activity for high-probability rewards (relative to low) was 
more pronounced in the bipolar disorder compared to healthy 
control group. 

The ventromedial PFC region of interest showed no main effects 
of interactions (P-values ^ 0.16). 

Outcome: Task-related effects in the dorsolateral PFC region of 
interest included effects of valence [F(1 ,38) = 19.2, P< 0.001], 
hemisphere [F(1 ,38) = 6.16, P = 0.018], and a trend for magnitude 
[F(1,38) = 3.55, P = 0.067]. These effects indicated that dorsolat- 
eral PFC activation was (i) greater for gains than losses; (ii) greater 
for large than small outcomes; and (iii) greater in right dorsolateral 
PFC, across conditions, relative to left. 

A three-way valence x probability x group interaction 
[F(1 ,38) = 4.96, P = 0.03] also emerged (Fig. 2). Follow-up 
ANOVAs on gain and loss outcomes separately showed that the 
interaction between probability and group was specific to gain 
outcomes [P(1 ,38) = 5.62, P = 0.023; losses, P = 0.58). 7-tests 
showed that the groups differed in high probability gains 
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Figure 2 Abnormal dorsolateral prefrontal cortex (PFC) activity in bipolar disorder correlates with risk-taking. Left: Effect of probability 
(High > Low) in whole-brain analysis of gamble outcomes. Right: Blood oxygen level-dependent signal from a priori region of interest 
analysis. Top right: Whereas controls preferentially activate dorsolateral PFC for high-probability (safe) reward outcomes, the bipolar 
disorder group show the opposite pattern, activating this region more for low-probability (risky) rewards. Bottom right: Greater real-life 
risk-taking in bipolar disorder group is associated with a reduced response to safe reward outcomes in right dorsolateral prefrontal cortex 
[r= -0.629, P = 0.005]. 
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[£(38) = 2.46, P = 0.019; low probability, P = 0.84], with greater 
dorsolateral PFC activity observed in controls. 

In the ventral striatum region of interest, task effects of valence 
[F(1 ,38) = 53.7, P< 0.001], magnitude [F(1 ,38) = 5.51 , P = 0.024] 
were significant, in addition to valence x probability 
[F(1 ,38) = 7.71 , P = 0.008] and valence x magnitude 
[F(1,38) = 4.05, P = 0.051] interactions. These effects indicated 
greater activity for gains than losses (Figs 1B and 3), and for 
large compared to small outcomes. As expected, the interactions 
signified that low probability and large magnitude outcomes 
increased ventral striatum activity for gains, but reduced activity 
for losses. These findings are consistent with the ventral striatum 
playing a role in the savouring of rewards, particularly those that 
are unexpected and large. An overall effect of group was signifi- 
cant [F(1,38) = 4.51, P = 0.04], and group interacted with valence 
[F(1,37) = 4.06, P = 0.05]. Separate ANOVAs for gain and loss 
outcomes showed that the groups only differed in ventral striatal 
activity for gain outcomes [F(1 ,38) = 5.16, P = 0.029; losses, 
P = 0.17], with the stronger response in the bipolar disorder pa- 
tients signalling greater response to rewards (Fig. 3). 

Task-related effects in the ventromedial PFC region of interest 
included effects of valence [F(1,38) = 28.6, P < 0.001], probability 
17(1,38) = 3.91, P = 0.05], in addition to an interaction between 
valence and probability [P(1 ,38) = 15.0 P< 0.001]. As per the 
ventral striatum findings, follow-up ANOVAs on gain and loss 
outcomes separately showed that an effect of probability in the 



ventromedial PFC was specific to gain [P(1,38) = 13.6, P < 0.001; 
losses, P = 0.15]. This finding shows that the ventromedial PFC 
was most responsive to low-probability rewards. Also resembling 
the pattern within ventral striatum, valence interacted with group 
[P(1 ,38) = 4.62, P = 0.038]. Follow-up Mests on gain and loss out- 
comes showed that, relative to controls, patients with bipolar dis- 
order responded more strongly to gains [£(1 ,38) = 1 .71 , P = 0.08] 
but not losses [£(1 ,38) = 0.14, P = 0.89], consistent with a stronger 
lower-order preference for rewards. 

Modulation of valuation by dorsolateral prefrontal 
cortex and ventral striatum 

The aim of this analysis was to see how activity in ventral striatum 
and dorsolateral PFC related to activity in ventromedial PFC during 
processing of gain outcomes, and whether this differed across 
groups. Outcome-locked activity in ventromedial PFC in response 
to gains was entered as a dependent variable in an ANCOVA, 
with outcome-locked activity in the ventral striatum and dorsolat- 
eral PFC in response to gains as covariates, and group as a fixed 
factor. 

Activity in dorsolateral PFC interacted with ventral striatum 
[F(1,39) = 3.45, P = 0.01]: stronger combined activity between 
dorsolateral PFC and ventral striatum predicted greater activity in 
ventromedial PFC. A combination of outcomes that were subject- 
ively preferred in addition to being perceived to promote the 
higher-order goal of maximizing outcomes promoted strongest 
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Figure 3 Hyper-hedonic response to reward in bipolar disorder. Top: Activity in ventral striatum and the ventromedial prefrontal cortex in 
response to gains [Gain > Loss contrast across all subjects (whole-brain analysis; P < 0.05, familywise error corrected)]. Bottom: blood 
oxygen level-dependent signal from a priori region of interest analysis of left and right ventral striatum for gain and loss outcomes. After a 
Roulette gamble, patients with euthymic bipolar disorder show a hyper-hedonic response to winning, and a smaller response to losing. 
This may result in greater swaying by the 'feel-good' highs and less influence of the potential hazards of risky choices. Asterisk indicates 
significant group difference (P ^ 0.05). 
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activity in ventromedial PFC. Group separately interacted with 
ventral striatum [F(1 ,39) = 7.32, P = 0.01] and with dorsolateral 
PFC [F(1 r 39) = 12.1, P< 0.001], indicating that the two groups 
differed in strength or sign of the association between activity in 
these two regions and ventromedial PFC. Follow-up correlations 
were used to assess these possibilities. With regard to the relation- 
ship between ventromedial PFC and ventral striatum activations, 
the correlation was stronger for patients [r(20) = 0.662, 
P< 0.001] than controls [r(20) = 0.464, P = 0.04], although this 
difference did not reach significance with follow-up testing 
(Fisher's Z = 0.86, P = 0.19). With regard to the relationship be- 
tween dorsolateral PFC and ventromedial PFC, the correlation was 
positive in controls [r(20) = 0.704, P< 0.001]; however, it was 
negative in the bipolar disorder group [r(20) = -0.478, 
P = 0.03]. These correlations significantly differed between 
groups (Fisher's Z = 2.78; P < 0.001; Fig. 4). 

The above analysis was repeated, but only in the bipolar dis- 
order group, with depression and mania symptom scores included 
in the model. Mania interacted with ventral striatal activity 
[F(1,13) = 9.39, P = 0.009]. No other effects or interactions reach- 
ing significance. Subsequent partial correlations (controlling for 
depression scores) were performed using the predicted scores of 
the interaction (mania with ventral striatal activity) on ventro- 
medial PFC. These showed that mania score augmented the posi- 
tive relationship between ventral striatal and ventromedial PFC 
activities 1/(17) = 0.636, P = 0.003 for correlation between ventral 
striatum and ventromedial PFC; r(17) = 0.842, P < 0.001 for the 
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Figure 4 Outcome valuation in bipolar disorder is driven more 
by ventral striatal than dorsal prefrontal cortical signals. In con- 
trols, ventromedial prefrontal cortical (vmPFC) activity showed 
moderate positive associations with both ventral striatum and 
with dorsolateral prefrontal cortex (dIPFC), consistent with 
comparable influences of both lower-order (ventral pathway) 
and higher-order (dorsal pathway) goals. In contrast, the bipolar 
disorder group showed a strong positive association in the 
ventral pathway (between ventral striatum and ventromedial 
PFC), and an inverse association in the dorsal pathway (between 
dorsolateral PFC and ventromedial PFC), suggesting that deci- 
sion-making is influenced more by lower-order rather than 
higher-order aspects. Green and red lines denote positive and 
negative correlation, respectively. Thickness of connecting line 
denotes the strength of correlation coefficients. 



ventral striatal interaction with mania and ventromedial PFC]. This 
is consistent with state mania augmenting the role that lower- 
order preference plays in the evaluation of gain outcomes in 
ventromedial PFC. 

Effects of trait impulsivity and risk-taking on the 
processing of safe and risky gains 

As hypothesized, interactions emerged between personality traits 
(impulsivity, risk-taking) and neural activity related to reward 
probability. The probability by risk-taking interaction was signifi- 
cant in dorsolateral PFC [F(1 ,37) = 3.89, P = 0.05] and marginally 
significant in ventral striatum [F(1 ,37) = 3.86, P = 0.057]. 
Although in the direction hypothesized, the probability by impul- 
sivity interactions failed to reach significance in dorsolateral PFC 
(P = 0.07) and ventral striatum (P = 0.13). There were no further 
effects or interactions in these structures, or in ventromedial PFC 
(P>0.31). 

To explore the above interactions, correlations were performed 
between trait measures and the neural activity in response to low- 
probability and high-probability gain outcomes, separately. As pre- 
dicted, the neural response to high-probability gains was nega- 
tively correlated with impulsivity in dorsolateral PFC 
[r(40) = -0.36, P = 0.023)] and with risk-taking score in both 
dorsolateral PFC [r(40) = -0.479, P = 0.002; Fig. 2] and ventral 
striatum [r(40) = -0.50, P< 0.001]. These correlations remained 
significant when controlling for group (partial correlation 
P ^ 0.02). Neither trait impulsivity nor risk-taking was related to 
activations for low-probability gains (P ^ 0.24). 

Discussion 

This study sought to specify the neural basis of suboptimal deci- 
sion-making in bipolar disorder through examining activity during 
the anticipation and experience of safe and risky prospects. Recent 
research into the functional anatomical basis of optimal decision- 
making demonstrates interplay between frontostriatal systems. 
Whereas longer-term goals are represented in dorsolateral PFC 
and lower-order preference in ventral striatum, ventromedial PFC 
has been proposed to integrate these signals into a weighted valu- 
ation that ultimately drives subsequent behaviour (Plassmann 
et al., 2006). Our findings suggest that in bipolar disorder, and 
potentially other disorders characterized by impulsivity, the 
weighting of these signals (in ventromedial PFC) may be biased 
towards the ventral striatal contribution, and away from the 
dorsolateral PFC signal. In this way lower-order, strongly desired 
outcomes are favoured above and beyond those that fit with the 
long-term goal (Fig. 4). 

Patients with bipolar disorder evidenced hyperactivation of ven- 
tral striatum both during anticipation and experience of rewards. 
When anticipating outcomes, these patients showed a greater in- 
crease in left ventral striatal activation for high reward probability 
gambles, compared with control subjects. This finding fits with 
recent research both in euthymic patients (Nusslock et al., 2012) 
and in people vulnerable to bipolar disorder (O'Sullivan et al., 
2011), and may indicate that when rewards are likely to be avail- 
able, this group have a greater drive to obtain them compared 
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with control subjects. This may explain their elevated levels of 
goal-striving and willingness to expend effort to obtain reward 
(Johnson et al., 2012). Subsequently, when processing reward 
outcomes, patients with bipolar disorder showed hyperactivation 
of both ventral striatum and ventromedial PFC, compared with 
control subjects. This fits with our hypothesis that a trait feature 
of bipolar disorder is a greater hedonic impact of rewards, thereby 
making reward-seeking behaviours more enticing. Ventral striatal 
activation is associated with selecting immediate over delayed re- 
wards (McClure et al., 2004), and so the dominance of this signal 
on integrated valuation (ventromedial PFC) reported here may be 
causally related to the impulsive delay discounting trajectory pre- 
viously reported in bipolar disorder (Strakowski et al., 2010; 
Mason et al., 2012/?). Although further work will be needed to 
examine this possibility, manic symptoms in the present study 
were associated with a strengthening of the association between 
ventral striatum and ventromedial PFC. This provides a link to the 
increases in impulsive and unrestrained reward-seeking behaviour 
associated with mania. 

The divergent profile of activity in dorsolateral PFC between 
groups further points towards an overvaluation of lower-order 
goals in bipolar disorder. Consistent with our hypothesis, controls 
preferentially activated dorsolateral PFC for rewards of high (rela- 
tive to low) probability — both during anticipation and delivery 
stages. These trials likely held the greatest attentional relevance 
in terms of the overarching goal of maximising earnings across the 
task, adding to past research that links dorsolateral PFC to the 
pursuit of longer-term or superordinate goals (Hare et al., 2009; 
Staudinger et al., 2011; Diekhof etal., 2012). In contrast, patients 
with bipolar disorder showed the opposite pattern of results to 
controls, with dorsolateral PFC preferentially responding to low- 
probability (i.e. more risky) rewards. This provides initial evidence 
that in bipolar disorder, this dorsal control system fails to suppress 
or devalue behaviours associated with immediate payoff in favour 
of those that fit with a longer-term, superordinate goal (Hare 
et al., 2009; Staudinger et al., 2011; Diekhof et al., 2012). This 
interpretation is further validated by the inverse association be- 
tween dorsolateral PFC activation in response to safe rewards 
and both impulsivity and risk-taking traits (Fig. 2). From this find- 
ing it is reasonable to infer that other disorders characterized by 
impulsivity, both psychiatric and neurological, may be similarly 
characterized; by absent or reduced dorsolateral PFC-mediated 
upregulation of behaviours affording to longer-term or higher- 
order goals. Further research will be needed to examine this 
possibility. 

Finally, although dorsolateral PFC activity was positively correlated 
with that of ventromedial PFC in controls, these two regions were 
negatively correlated in bipolar disorder group. This fits with an exist- 
ing finding in euthymic patients of both reduced frontopolar (dorsal) 
cortical activation and increased ventromedial PFC activation during 
a gambling task (Jogia et al. , 201 2). Collectively, these findings sug- 
gest that valuation of outcomes in bipolar disorder is biased in favour 
of lower-order preference, even when this conflicts with the long- 
term goal of maximizing winnings (Fig. 4). 

Although we attempted to reduce the impact of medication on 
reward -related neural activity by excluding antipsychotic medica- 
tion, future work should examine reward processing in 



medication -naive participants. Further, rates of (historic) alcohol 
and substance use were higher in the bipolar disorder group. 
Because addiction has been associated with differences in reward 
processing in its own right (Hommer et al., 2011), it cannot be 
ruled out that this also contributed to the group differences. 
However, supplementary analyses failed to show any effect of 
substance use history. It has also been suggested that bipolar dis- 
order and addiction have shared vulnerability factors (Alloy et al., 
2009). As with many reward tasks in the literature, the task em- 
ployed in the present study could not disentangle reward prob- 
ability from surprise. Recent work has shown that these processes 
may be coded by distinct regions of the human ventral midbrain 
(Boll et al., 2013) and future work will be needed to delineate 
these processes in disorders characterized by reward dysregulation. 

In summary, we report evidence that in bipolar disorder both 
ventral (striatum) and dorsal (dorsolateral PFC) frontostriatal 
reward systems attend to lower-order goals, overvaluing outcomes 
that are strongly desired, but suboptimal in the long run. These 
findings strongly suggest that bipolar disorder cannot be reduced 
to affective instability alone, highlighting the centrality of goal 
regulation in understanding the impulsive and risky decision- 
making that spans the course of the disorder. Our findings may 
also have theoretical implications for understanding other impul- 
sivity disorders, as well as several implications for clinical interven- 
tion. First, they suggest that psychotherapeutic interventions might 
be enhanced by specifically attending to issues of goal regulation. 
Second, the neurophysiological markers of poor goal regulation 
that we have identified in this study, if confirmed by later inves- 
tigations, suggest some targets for novel psychological and 
pharmacological treatments. Moreover, third, these markers may 
be useful in evaluating both kinds of interventions. In particular, 
interventions that bolster dorsolateral PFC-mediated cognitive 
control may be an important direction for future research. 
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